# Multilingual Speech Synthesis

Llasa 3B
Llasa is a text-to-speech (TTS) system based on LLaMA, which extends the capabilities of the language model by integrating speech tokens, supporting Chinese and English speech generation.
Speech Synthesis Supports Multiple Languages
L
unsloth
55
1
Handler
MIT
Bark is a Transformer-based text-to-audio model created by Suno, capable of generating highly realistic multilingual speech, music, background noise, and sound effects.
Speech Synthesis Supports Multiple Languages
H
walterheart
20
0
Hindispeech
IndicF5 is a near-human level multilingual text-to-speech (TTS) model supporting 11 Indian languages.
Speech Synthesis Other
H
ShriAishu
31
0
Epxtts
Other
viⓍTTS is a voice generation model capable of cloning voices into different languages using a 6-second short audio clip.
Speech Synthesis Other
E
epchannel
22
0
Indicf5
IndicF5 is a near-human multilingual text-to-speech (TTS) model trained on 1,417 hours of high-quality speech data, supporting 11 Indian languages.
Speech Synthesis Other
I
ai4bharat
6,595
37
Speecht5 Finetuned Voxpopuli Lt
MIT
A text-to-speech model fine-tuned on the VoxPopuli dataset based on microsoft/speecht5_tts
Speech Synthesis Transformers
S
hungphan111
19
0
Kokoro 82M
Apache-2.0
Kokoro is an open-source TTS model with 82 million parameters, delivering audio quality comparable to larger models while offering significant speed advantages and cost efficiency.
Speech Synthesis English
K
prince-canuma
376
2
Yarngpt2
Apache-2.0
YarnGPT2 is a text-to-speech (TTS) model specifically designed for synthesizing Nigerian-accented languages (Yoruba, Igbo, Hausa, and English).
Speech Synthesis Transformers English
Y
saheedniyi
2,023
4
Cosyvoice2 0.5B
CosyVoice is a text-to-speech (TTS) model that supports multilingual and voice conversion capabilities, providing high-quality speech synthesis.
Speech Synthesis
C
FunAudioLLM
4,573
114
Parler Tts Mini Multilingual V1.1
Apache-2.0
Parler-TTS Mini Multilingual v1.1 is a multilingual extension based on the Parler-TTS Mini version, supporting text-to-speech in 8 European languages.
Speech Synthesis Transformers Supports Multiple Languages
P
parler-tts
3,020
32
Indri 0.1 350m Tts
Indri is a novel, ultra-small, lightweight TTS model based on the Transformer architecture, supporting text-to-speech tasks in English and Hindi.
Speech Synthesis Transformers Supports Multiple Languages
I
11mlabs
1,088
0
GPT SoVITS V1 Base
MIT
GPT-SoVITS (V1) is a multilingual text-to-speech foundation model supporting Chinese, English, and Japanese.
Speech Synthesis Supports Multiple Languages
G
None1145
20
1
Indic Parler Tts Pretrained
Apache-2.0
The Indic Parler-TTS Pretrained Model is a multilingual Indian language extension of Parler-TTS Mini, supporting 21 languages, including various Indian languages and English.
Speech Synthesis Transformers Supports Multiple Languages
I
ai4bharat
1,102
8
Indic Parler Tts
Apache-2.0
Indic Parler-TTS is a multilingual extension of Parler-TTS Mini, supporting 21 languages including various Indian languages and English.
Speech Synthesis Transformers Supports Multiple Languages
I
ai4bharat
43.59k
124
Vits Ar Sa A
This is a Transformers-based Text-to-Speech (TTS) model capable of converting input text into natural speech output.
Speech Synthesis Transformers
V
wasmdashai
227
2
Cosyvoice 300M SFT
CosyVoice is a text-to-speech (TTS) model that supports multilingual and multi-style voice synthesis.
Speech Synthesis
C
FunAudioLLM
1,768
13
Speecht5 Tts Urdu
MIT
A Urdu text-to-speech model fine-tuned on Microsoft's SpeechT5 architecture, supporting Romanized input
Speech Synthesis Transformers Other
S
hisanusman
15
0
Ttsvi
Other
viⓍTTS is a voice generation model that supports voice cloning in 18 languages, with special optimization for Vietnamese.
Speech Synthesis Transformers Other
T
ntdgo
41
9
Mms Tts Tuk Script Latin
A Turkmen text-to-speech model developed by Meta, part of the Massively Multilingual Speech project, supporting speech synthesis for Turkmen written in Latin script.
Speech Synthesis Transformers
M
facebook
51
2
Mms Tts Cat
Catalan text-to-speech model developed by Meta, utilizing the VITS end-to-end architecture for high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
204
1
Mms Tts Ben
Bengali text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
1,686
3
Mms Tts Bem
Bemba (bem) text-to-speech model developed by Meta, part of the Massively Multilingual Speech project
Speech Synthesis Transformers
M
facebook
67
1
Mms Tts Som
A Somali text-to-speech model developed by Meta as part of the MMS project, supporting the conversion of Somali text into natural speech.
Speech Synthesis Transformers
M
facebook
592
4
Mms Tts Kek
Kekchi text-to-speech model developed by Meta, part of the Massively Multilingual Speech project
Speech Synthesis Transformers
M
facebook
20
0
Mms Tts Ory
Odia text-to-speech model from Facebook's MMS project, achieving high-quality speech synthesis based on the VITS architecture
Speech Synthesis Transformers
M
facebook
148
4
Mms Tts Lat
Latin text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
90
2
Mms Tts Ron
Romanian text-to-speech model developed by Meta, utilizing VITS architecture for high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
3,822
4
Mms Tts Tgl
An end-to-end text-to-speech model for Tagalog developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
1,579
3
Mms Tts Ese
An Ese Ehue text-to-speech model developed by Meta as part of the Massively Multilingual Speech project, supporting high-quality speech synthesis.
Speech Synthesis Transformers
M
facebook
48
0
Mms Tts Tel
Telugu text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
531
7
Mms Tts Tam
Facebook's Massively Multilingual Speech project's Tamil text-to-speech model, implementing high-quality speech synthesis based on the VITS architecture
Speech Synthesis Transformers
M
facebook
1,109
11
Mms Tts Mar
Marathi text-to-speech model developed by Meta, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
351
3
Mms Tts Hif
A Hindi-Fijian text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
15
0
Mms Tts Sqi
Albanian text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
339
2
Speecht5 Finetuned Common Voice Be
MIT
Belarusian text-to-speech model based on Microsoft SpeechT5 architecture, fine-tuned on the Common Voice dataset
Speech Synthesis Transformers Other
S
KoRiF
27
0
Bark
MIT
Bark is a Transformer-based text-to-audio model created by Suno, capable of generating highly realistic multilingual speech, music, background noise, and simple sound effects.
Speech Synthesis Transformers Supports Multiple Languages
B
suno
35.72k
1,326
Silero Model V3 Ru
Silero Speech Model is a text-to-speech (TTS) model focused on Russian, developed and open-sourced by snakers4.
Speech Synthesis Transformers Other
S
imperialwool
22
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase